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Abstract 

In this correspondence, we study the minimum pseudo- weight and minimum 
pseudo-codewords of low-density parity-check (LDPC) codes under linear pro- 
gramming (LP) decoding. First, we show that the lower bound of Kelley, Srid- 
hara, Xu, and Rosenthal on the pseudo-weight of a non-zero pseudo-codeword 
of an LDPC code with girth greater than 4 is tight if and only if this pseudo- 
codeword is a real multiple of a codeword. Then, the lower bound of Kashyap 
and Vardy on the stopping distance of an LDPC code is proved to be also a lower 
bound on the pseudo-weight of a non-zero pseudo-codeword of an LDPC code 
with girth 4, and this lower bound is tight if and only if this pseudo-codeword 
is a real multiple of a codeword. Using these results we further obtain that for 
some LDPC codes, there are no other minimum pseudo-codewords except the 
real multiples of minimum weight codewords. This means that the LP decoding 
for these LDPC codes is asymptotically optimal in the sense that the ratio of 
the probabilities of decoding errors of LP decoding and maximum-likelihood de- 
coding approaches 1 as the signal-to-noise ratio (SNR) tends to infinity. Finally, 
some LDPC codes are listed to illustrate these results. 

Index Terms: LDPC codes, linear programming (LP) decoding, fundamental 
cone, pseudo-codewords, pseudo- weight, stopping sets. 
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I Introduction 



In the study of iterative decoding of low-density parity-check (LDPC) codes, Wiberg 
[28] and Koetter and Vontobel [12J showed that pseudo-codewords play an important 
role when characterizing the performance of LDPC codes. Koetter and Vontobel [T2] 
presented an explanation for the relevance of pseudo- codewords in iterative decoding 
based on graph covering and showed that the set of pseudo-codewords can be described 
by the so-called fundamental polytope. Recently, linear programming (LP) decoding 
of linear codes was introduced by Feldman, Wainwright and Karger [3] [I] . The feasible 
region of the linear programming problem in LP decoding [3] [4] agrees with the funda- 
mental polytope. It is known that when characterizing the performance of linear codes 
under LP decoding, pseudo-codewords, especially the pseudo-codewords with minimum 
pseudo- weight (or minimum pseudo-codewords for short), also play an important role. 
In [2], Di et al. showed that the performance of an LDPC code under message passing 
decoding over a binary erasure channel is closely related to the stopping sets in the 
factor graph. Since the support of any pseudo-codeword is a stopping set [12], there are 
some relations between the minimum pseudo-codewords and the nonempty stopping 
sets of smallest size [6J [20J [29J . 

Recently, pseudo-codewords and minimum pseudo-weights of binary linear codes 
have been studied in pQ, [3], g], [5]-Pj. [21], [25]- [27], and [29]. Chaichanavong and 
Siegel P, Theorem 3] gave a lower bound on the pseudo-weight of a non-zero pseudo- 
codeword of an LDPC code. Xia and Fu |29j showed that the Chaichanavong-Siegel 
bound is tight if and only if the pseudo-codeword is a real multiple of a codeword. 
Using this result they further obtained that for some LDPC codes, e.g., Euclidean 
plane and projective plane LDPC codes [15], there are no other minimum pseudo- 
codewords except the real multiples of minimum weight codewords. Recently, Kelley, 
Sridhara, Xu, and Rosenthal [8j Theorem III.1] [10[ Theorem 3.1] presented a lower 
bound on the pseudo-weight of a non-zero pseudo-codeword of an LDPC code with 
girth greater than 4, which includes the Chaichanavong-Siegel bound as a special case. 
In [6], Kashyap and Vardy gave a lower bound on the stopping distance of an LDPC 
code. In this correspondence, we study the minimum pseudo-weight and minimum 
pseudo-codewords of LDPC codes under LP decoding. The results mentioned in the 
abstract are obtained. The rest of this correspondence is organized as follows. In 
Section II, we briefly review LP decoding and pseudo-codewords of binary linear codes. 
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In Section III, the main results of this correspondence are given and some LDPC codes 
are listed to illustrate these results. In Sections IV, the proofs of the main results are 
given. In Section V we end with some concluding remarks. 

II Preliminaries 

Let C be a binary [n, k, d] linear code with length n, dimension k, and minimum 
distance d. The codewords with (Hamming) weight d are called minimum codewords 
of C. Let Ai be the number of codewords of weight i. Let H be an m x n parity- 
check matrix of C, where the rows of H may be dependent. Let I = {1, 2, . . . , n} and 
J = {1, 2, ... , m} denote the sets of column indices and row indices of H, respectively. 
The Tanner graph Gh corresponding to if is a bipartite graph comprising n variable 
nodes labelled by the elements of /, m check nodes labelled by the elements of J, and 
the edge set E C {(i,j) : i G I,j G J}, where there is an edge (i, j) G E if and only if 
hji = 1. The girth g of Gh, or briefly the girth of H, is defined as the minimum length 
of a cycle in Gh- Note that the girth g must be an even integer not smaller than 4. 

Definition 1 A stopping set S is a subset of I such that the restriction of H to S , i.e., 
the m x \S\ sub-matrix of H consisting of the columns indexed by S, does not contain 
a row of weight one. The smallest size of a nonempty stopping set, denoted by s(H), 
is called the stopping distance of C . A stopping set with size s(H) is called a smallest 
stopping set. The number of smallest stopping sets is denoted by T S (H). 

In other words, the stopping set S is a subset of variable nodes in Gh such that all 
the neighbors of S are connected to S at least twice. For more results on stopping sets 
and stopping distance we refer the readers to [2], [6j, [19], (20], and [30J. 

Suppose a codeword c is transmitted over a binary-input memoryless channel and 
y is the output of the channel. The log-likelihood ratio vector is defined by A = 
(Ai, A2, • • • , A n ) where Aj = In pr{^jc'=i} • conv (C) De the convex hull of C in 

the real space W n . Maximum-likelihood (ML) decoding is equivalent to the following 
optimization problem [3][1]: Find x G conv(C) that minimizes Ax T . To decrease the 
decoding complexity, the region conv(C) should be relaxed. For each row hj of H, 
1 < j < m, let Cj = {c G {0, l} n : hjC T = mod 2}. The fundamental polytope of C is 
defined as P{H) = f]j=i conv(Cj). LP decoding then solves the following optimization 
problem [3][1]: Find x G P{H) that minimizes Ax T . Note that conv(C) C P(H). 



3 



However, usually conv(C) C P(H) which implies that the LP decoder is a sub-optimal 
decoder. The support of a real vector x G M. n , denoted by supp(x), is defined as the 
set of positions of non-zero coordinates in x, or supp(x) = {i : X{ ^ 0}. Assuming 
that the channel is a binary-input output-symmetric channel, and given that the code 
C is linear, we can without loss of generality assume that the all-zeros codeword was 
transmitted. When analyzing the LP decoder for C it is then sufficient to understand 
the fundamental cone K(H) of H which is the conic hull of the fundamental polytope 
P{H). The fundamental cone K(H) can be characterized as follows [3j[4j[12j: it is the 
set of vectors of x = (xi, . . . , x n ) G W 1 such that Xj > 0, % = 1, . . . , n and 

VI < j < m, V i G supp(hj), 2_. x\ > Xj. (1) 

/esupp(hj)\{i} 

The elements of K(H) are called pseudo-codewords of C. Hence, the question of 
whether the LP decoder succeeds is equivalent to whether the following optimization 
problem has the zero vector as its optimal solution: Find x G K(H) that minimizes 
Y17=i x i^i- Two pseudo-codewords x, y are said to be equivalent if there exists a real 
number a > such that y = ax. Clearly, x G K(H) <^> ax G K(H). For any 
x G K(H), let [x] = {ax : a > 0}. 

Definition 2 A pseudo-codeword x is said to be internal if there exists a real number 
(3, < (3 < 1 and x«, x< 2 ) G K(H) \ [x] such that x = (3*^ + (1 _/?) x ( 2 ). // a nonzero 
pseudo- codeword x is noi internal, [x] is called an edge of K(H). Let M(H) denote the 
set of all edges of K(H). The pseudo-codewords on edges in M(H) are called minimal 
pseudo-codewords. 

It is known from [27] and linear programming theory [1] [18] that the behavior of the 
LP decoder is completely characterized by M(H) and \M(H)\ must be finite for fixed 
C and H. From now on, we only consider the binary-input additive white Gaussian 
noise (AWGN) channel. 

Definition 3 The (AWGN) pseudo-weight of a non-zero real vector x G M n is defined 
by Wp(x) = ||x||f /||x|||, where ||x||i = \xi\ + • ■ • + \x n \ and ||x|| 2 = \fx\ + ■ ■ ■ -\- x 2 n . 
Denote by dp(H) the minimum pseudo-weight of non-zero pseudo-codewords of C . The 
pseudo-codewords with pseudo-weight dp(H) are called minimum pseudo-codewords. 
Define the pseudo-weight of an edge [x] G M(H) as the pseudo-weight o/x. The edges 
with minimum pseudo-weight are called minimum edges. The number of minimum 
edges is denoted by B P (H). 



4 



It is not difficult to see from linear programming theory [TS] that minimum pseudo- 
codewords are also minimal pseudo-codewords. Note that the minimal pseudo-codewords 
in the same edge have the same pseudo- weight. 

Just like d and of a linear code are important for characterizing the performance 
of ML decoding, dp(H) and Bp(H) are crucial for characterizing the performance of 
LP decoding. In order to obtain better performance, we should try to find a desir- 
able parity-check matrix H to maximize dp(H) and then minimize Bp(H). Since the 
support of every codeword is a stopping set J2U] and every stopping set supports a 
pseudo-codeword [12], it is known that d P (H) < s(H) < d regardless of the choice of 
H, and B P (H) > T S (H) > A d for any H such that d P (H) = s(H) = d. It is well 
known that LP decoding is asymptotically optimal, in the sense that the ratio of the 
probabilities of decoding errors of LP decoding and ML decoding approaches 1 as the 
SNR tends to infinity, if and only if dp(H) = d and B P (H) = Ad. 

Next, we give an example to illustrate the above concepts. 

Example 1 Let C be a binary [7, 3, 4] cyclic simplex code. The parity-check matrix 
H of C is formed by a 7 x 7 circulant matrix, where the first row is (1 1 1 0). 
H has uniform column weight 3 and girth 6. The seven non-zero codewords of C 
are (1,0, 1, 1, 1,0,0) and its cyclic shifts, each of which is a minimum codeword. All 
non-empty stopping sets are {1,3,4,5}, {2,4,5,6}, {3,5,6,7}, {1,4,6,7}, {1,2,5,7}, 
{1, 2, 3, 6}, {2, 3, 4, 7}, {1, 2, 3, 4, 5, 6, 7}, where only the first 7 ones are smallest stop- 
ping sets. We choose one minimal pseudo-codeword as a representative from each edge 
in M(H). Then all 14 representatives are (1,0,1,1,1,0,0) and its cyclic shifts, and 
(1, 2, 1, 1,1, 2, 2) and its cyclic shifts, where only the first 7 ones are minimum pseudo- 
codewords. Thus, there are 14 edges in M(H) and 7 of which are minimum edges. 
Clearly, C satisfies d P (H) = d = 4 and B P (H) = A^ = 7, which implies that LP 
decoding is asymptotically optimal for C. 

Ill Main Results 

Let C be a binary [n, k, d] linear code with parity-check matrix H. If the Tanner graph 
Gh has the girth g > 6 and H has uniform column weight 7, Tanner [22J showed that 
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the minimum distance d fulfills d > di, where 

l+7 + E£ 6)/4 7(7-l) 1 , s/2odd, 
dL = { (2) 



1 + 7 + El7 }/ 7(7 - l) 1 + (7 - 1) (9 ~ 4)/4 , 9 f2 



even. 



Orlitsky et al. [19] further obtained that di is still a lower bound on the stopping 
distance, i.e., s(H) > d^. Recently, Kelley, Sridhara, Xu, and Rosenthal [8] [10] proved 
that the minimum pseudo- weight satisfies dp(H) > di, and the bound still holds 
when H has non-uniform column weight with minimum column weight 7. In the next 
theorem, which will be proved in section IV, we give a necessary and sufficient condition 
for wp(x) = di to hold for a non-zero pseudo- codeword x G K(H). 

Theorem 1 Let C be a binary linear code with length n. Let H be a parity-check matrix 
of C with girth g > 6 and minimum column weight 7. Let x = (xi, . . . , x n ) G K(H) be 
a non-zero pseudo-codeword and dL be defined in Then wp(x) = di if and only if 
c^x/ Ym=i x i i s a codeword of C with weight dL- 

It is easy to check that 

^ + 2(^1), g/ 2 Odd, 

dL={ ,/2even, (3) 



di 



(4) 



(3-1 

where (3 = 7 — 1. In particular, 

+ 2, <? = 6, 

2(^ + 1), (7 = 8, 

2 + 20 + 2, (7 = 10, 
2(0 2 + /5 + l), (7 = 12, 

3 + 2/3 2 + 2/5 + 2, g = 14, 
2(/3 3 + /3 2 + /3 + l), (7 = 16. 

Clearly, for the codes satisfying the conditions of Theorem [TJ d = dL will imply dp(H) = 
s(H) = d. Furthermore, by Theorem [U we have the next corollary. 

Corollary 2 Let C be a binary [n, k, d] linear code with length n, dimension k and 
minimum distance d. Let H be a parity-check matrix of C with girth g (g > 6) and 
minimum column weight 7. If d = dL, where dL is defined in §Z§, then Bp(H) = 
T S (H) = Ad, where is the number of minimum codewords, T S (H) is the number of 
smallest stopping sets, and Bp(H) is the number of minimum edges. 
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Remark 1 For a code C satisfying the conditions of Corollary [H, the minimum code- 
words, the nonempty stopping sets of smallest size and the minimum edges are all 
equivalent, which implies that LP decoding is asymptotically optimal for C . 

In Example 1, 7 = 3, g — 6, d = 4, and di, = 1 + 7 = 4 = d. Hence, C satisfies the 
conditions of Theorem [1] and Corollary [2], and Bp(H) = T S (H) = = 7. 

Note that [291 Theorems 1 and 2] are the special case of g = 6 of Theorem [I] and 
Corollary [2j respectively. In [29J, it is shown that two classes of finite geometry LDPC 
codes, i.e., the projective plane LDPC codes and Euclidean plane LDPC codes [T5] . 
meet the conditions of Corollary [2j Thus, LP decoding is asymptotically optimal for 
finite plane LDPC codes. Below we give some more examples of LDPC codes satisfying 
the conditions of Corollary [2J 

Example 2 A class of regular LDPC codes called LU(3, q) codes were constructed in 
[7j, where q is a prime power. LU(3,q) codes have the following parameters, where n 
is the code length, d is the minimum distance, m is the number of rows of the parity- 
check matrix, p is the uniform row weight of the parity-check matrix, 7 is the uniform 
column weight of the parity-check matrix, and g is the girth of the Tanner graph. 

n = g 3 , m = g 3 , d = 2q, p = q, 7 = Q, 9 = 8. 

This class of LDPC codes meet the conditions of Corollary [2l Thus, LP decoding is 
asymptotically optimal for LU(3, q) codes. 

Example 3 In [17J [24J, regular LDPC codes were constructed from generalized poly- 
gons. In P~7| Table 1], for a prime power q, LDPC codes W(q), H(3, q 2 ), H(q), T(q 3 , q), 
and 0(q) have the following parameters, where n is the code length, d is the minimum 
distance, 7 is the uniform column weight of the parity-check matrix, and g is the girth 
of the Tanner graph. 

(i) W(q) : n = (q + l)(q 2 + 1), d = 2(q + 1), 7 = q + 1, g = 8; 

(ii) H(3, q 2 ): n = (q 2 + l)(g 3 + 1), d = 2(q + 1), 7 = q + 1, 9 = 8; 

(iii) H(q) : n = (q + l)(q 4 + q 2 + 1), d = 2(q 2 + q + 1), 7 = q + I, g = 12; 

(iv) T(g 3 , q): n = (q 3 + l)(q 8 + q 4 + 1), d = 2(q 2 + q + 1), 7 = q + 1, g = 12; 
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(v) 0(q), q = 2 2e+1 : n = (q 2 + l)(g 3 + l)(g 6 + 1), d = 2(g 3 + g 2 + q + 1), 7 = q + 1, 
= 16. 

By it is obvious that these LDPC codes meet the conditions of Corollary [2j Thus, 
LP decoding is asymptotically optimal for them. 

Example 4 In [21], some LDPC codes with dp(H) = d were constructed by enumer- 
ating a regular tree for a fixed number I of layers and employing a connection algorithm 
based on mutually orthogonal Latin squares to close the tree. 

(i) Type-I A construction [2TJ: It is known that if I or g/2 is odd, then d = d^. Hence, 
these LDPC codes with odd g/2 meet the conditions of Corollary [2] and LP decoding 
is asymptotically optimal for them. 

(ii) Type-II construction [21]: For the binary case and I = 3, the Type II construction 
yields exactly the projective plane LDPC codes [23j[29j. For the binary case and I = 4, 
it is conjectured that d — d^ in [21]. Clearly, if this conjecture is true, then these 
LDPC codes meet the conditions of Corollary [2] and LP decoding is asymptotically 
optimal for them. In particular, it is known from [2T] and [21] that this is true for the 
(2, 2)-Finite-Generalized-Quadrangles-based LDPC codes. 

The next theorem shows that the lower bound of Kashyap and Vardy [6J on the 
stopping distance of an LDPC code is also a lower bound on the pseudo-weight of a 
non-zero pseudo-codeword of this LDPC code, and this lower bound is tight if and only 
if this pseudo-codeword is a real multiple of a codeword. The proof of this theorem 
will be given in section IV. 

Theorem 3 Let C be a binary linear code with length n. Let H be an m x n parity- 
check matrix of C with minimum column weight 7. If any two distinct columns of 
H have at most A common 1 's and 7/A is an integer, then iOp(x) > j + 1 for any 
non-zero pseudo-codeword x = (07, . . . ,x n ) G K(H). Moreover, equality holds if and 
only if (¥ + l)x/ Ym=i x i ^ s a codeword of C with weight ? + 1- 

Remark 2 If H has uniform column weight 7, Kashyap and Vardy [6, Theorem 1] 
showed that the stopping distance s(H) > % + 1. Since dp(H) < s(H), Theorem^ 
implies the Kashyap-Vardy lower bound on the stopping distance. 

Clearly, for codes satisfying the conditions of Theorem [31 d — ? + 1 will imply 
dp(H) = s(H) = d. Furthermore, by Theorem [3], we obtain the next corollary. 
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Corollary 4 Let C be a binary [n, k, d] linear code with length n, dimension k and 
minimum distance d. Let H be a parity- check matrix of C with minimum column 
weight 7. If any two distinct columns of H have at most A common l's and 7/ A is 
an integer, and if d = ? + 1, then Bp(H) = T S (H) = A^, where is the number of 
minimum codewords, T S (H) is the number of smallest stopping sets, and Bp(H) is the 
number of minimum edges. 

Remark 3 In Theorem and Corollary ^ the girth of the Tanner graph Gh is at 
least 6 if X = 1 and 4 if A > 1. The special case of A = 1 in Theorem^ and Corollary 
\4\ are exactly [291 Theorems 1 and 2] and the special case of g = 6 in Theorem [1\ and 
Corollary^ 

Example 5 Consider the binary [2 r — 1, 2 r — r— 1, 3] Hamming code. Let if be a parity- 
check matrix which consists of all the non-zero codewords of the binary [2 r — 1, r, 2 r ~ 1 ] 
simplex code. It is easy to see that 7 = 2 r ~ 1 and A = 2 r ~ 2 . Hence, by Theorem [3] and 
Corollary HI dp(H) = 3 and any minimum pseudo-codeword must be the real multiple 
of some minimum codeword. 

Let q = 2 s and EG(m, q) be the m-dimensional Euclidean Geometry over GF(q). It 
is known from [16] and [23] that there are q m points and q(q m — l)/(q — l) hyperplanes 
in EG(m,q). By removing a point of EG(m,q) together with the (q m — l)/(q — 1) 
hyperplanes containing this point, we obtain a slightly modified incidence matrix H of 
points and hyperplanes in EG(m, q). Suppose the rows of H indicate the hyperplanes. 
The point-hyperplane Euclidean geometry LDPC code C with the parity-check matrix 
H has the following parameters: length n = q m — 1, uniform column weight of H 
7 = g m_1 , uniform row weight p = g m_1 , girth of Tanner graph g = 4 if m > 2. It is 
easy to see that any two distinct columns of H have at most A = q m ~ 2 common l's. 
By Theorem [31 we have that d > s(H) > dp > 7/A + 1 = 5 + 1. From [16] and [25] . 
we know that H can be put in cyclic form and the generator polynomial g(x) can be 
determined. By [T6J p. 315, (8.33)], it is known that the dimension k = 2 sm — (m+ l) s . 
The following examples show that d = q + 1 in some cases. 

Example 6 Let m = 3 and s = 2. Then C is a binary [63,48] code with generator 
polynomial g(x) = 1 + x 2 + x 4 + x 11 + x 13 + x 14 + x 15 [HH pp. 310-311]. ByTheorem[3l 
we know that d > s(H) > dp > q + 1 = 5. In fact, it is easy to calculate by computer 
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that d does equal 5. For example, 1 + x 23 + x 33 + x 36 + x 37 is a weight-5 codeword. 
Hence, by Corollary 0], we have that Ad = T S (H) = Bp(H). 

Example 7 Let m = 3 and s = 3. Then C is a binary [511, 448] code [23l Example 1]. 
The girth of the Tanner graph is 4 and C performs very well under iterative decoding 
[25] . By Theorem [31 we know that d > s(H) > dp > q + 1 = 9. In fact, it can 
be calculated by the method in [5] that d = 9. Hence, by Corollary HI we have that 
Ad = T S (H) = Bp(H) and LP decoding for C is asymptotically optimal. 



IV Proofs of Theorems [T] and [3 

In this section, we prove Theorems [T] and [3J Chaichanavong and Siegel [TJ Proposition 
2] gave a lower bound on the pseudo- weight of a real vector. In [29], the necessary and 
sufficient condition for this bound being tight is discussed. Let u be a positive integer. 
Denote T u the set of vectors y G [0, l/u) n such that Y^7=iVi = 

Lemma 1 [29] For any y G >F U , we have Wp(y) > it. Equality holds if and only if y 
has exactly it non-zero components with value l/u. 

A. Proof of Theorem [1] 

From the proof of [TQl Theorem 3.1], we know that for any non-zero x = (xi, . . . , x n ) G 
K(H), d L Xi < YTj=i x j f° r an y * e supp(x). Let y = x/^]" =1 Xj. Then y G K(H) 
and y G J 7 ^. Hence, by Lemma [U wp(x) = wp(y) > c?l, where equality holds if 
and only if c = <^lx/^" =1 xz is a binary vector with weight d^. Now, we show that 
tup(x) = if and only if c G C and %(c) = c?^. If c G C and %(c) = di, 
then wp(x) = wp(c) = %(c) = d L . On the other hand, if u>p(x) = then the 
pseudo-codeword c is a binary vector with weight d^. Next, we show that c G C. 

Clearly, S = supp(c) is a stopping set with size di since c is a pseudo-codeword. 
For any fixed i G S, we construct a local tree of i (see Figure 1) as in the proof of 
[inj Theorem 3.1]. For the sake of convenience, we briefly describe the construction 
procedure as follows. Below we use /, e to denote check nodes and i, j to denote variable 
nodes of the Tanner graph Gh- Let t = \_{g — 6)/4j > 0, where \_x\ is the floor function 
which denotes the maximum integer not greater than x. Then g = At + 6 for odd g/2 
and g = At + 8 for even g/2. In the local tree of i, i is the root of the tree. A check 
node / connected to i is called a child of i, and a variable node j connected to / except 
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Figure 1: Local tree structure for a > 7- left graph 

its parent i is called a child of / or a grandchild of i, and a check node e connected 
to j except its parent / is called a child of j, and so on. For a variable node j in the 
local tree, let child(j) and grch(j) denote the sets of all children and grandchildren of 
j respectively. Note that 

grch(j) = |J child(/). (5) 

/echiid(j) 

All nodes in Lq(i) = grch(z) are called Level-0 variable nodes. For m = 1, 2, . . . , t, all 
nodes in 

L m (i)= |J grch(j) (6) 

jeL m -i(i) 

are called Level-m variable nodes. Fixing a check node /* G child(i), denote No(f*) = 
child(/*) and 

Nm(t) = U § rch (j), rn = 1,2,..., t + 1. (7) 

jev m _x(/«) 

The local tree of i has t levels if ^ = At + 6 and t + 1 levels if g — At + 8, where iV t+1 (/*) 
is the set of (t + l)-th level nodes. Since the Tanner graph Gh has girth g > 6, the 
local tree of z has the following pairwise disjoint properties: (i) all child(/) in the union 
of (0) are pairwise disjoint, and all grch(/) in the union of (jSJ) are pairwise disjoint; (ii) 
if g = At + 6, {i}, L (i), . . . , L t (i) are pairwise disjoint; (hi) if g = At + 8, all grch(/) in 
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the union of d7j) are pairwise disjoint, and {i},L (i), . . . ,L t (i), N t+ i(f*) are pairwise 
disjoint. 

Since there are at least 7 l's in every column of H, from the construction we have 
that 

|child(i)|>7 and |child(j)| > 7 - 1 (8) 

for any intermediate variable node j in the local tree of i. Let j be a variable node 
which has some children in the local tree of i. Suppose j G S. For each check node 
/ G child (j), |child(/) n S\ > 1 since S is a stopping set including j. Thus, noting that 
i G S, by ([5} and the pairwise disjoint properties, we have 

\L (i) nS\ = |grch(2) n S\ > 7; |grch(j) n 3\ > 7 - 1 if j G S \ {i}. (9) 

Moreover, a necessary condition for equality in \Lo(i) n 5*| = 7 is that for each / G 
child(z), |child(/) n *S | = 1 . In other words, for any row h of H whose i-th component 
is 1, wnQn-s) — 2 where h,s is the restriction of h to S. Furthermore, by (|SJ), © and 
the pairwise disjoint properties, for m — 1, 2, . . . , t, 

\L m (i) n S\ = j |J grch(j) n Sj > j |J grch(j) n S 

jei m -i(i) jeL m _!(i)ns 

> (7 - i)|An-i(i) n 5| > • • • > (7 - i) m |^oW n s\ > (7 - i) m 7 

and if g = At + 8, 

\N t+1 (f*)nS\ = J |J grch(j)n5|>( 7 -i)|iv t (r)n5| 

jeN t (f*) 

> • • - > (7 - l) t+1 |child(/*) n 5| > (7 - l) m . 

In other words, |{z} n S\ = 1, |L (z) n 5| > 7, n S\ > 7(7 - 1), . . ., \L t (i) n S\ > 

7(7 - 1)*, and \N t+1 (f*) n 5| > (7 - l) m if g = At + 8. Therefore, we have |5| > rf L 
by adding the above inequalities and using the pairwise disjoint properties, where a 
necessary condition of \S\ = is that \L (i) PI S\ = 7, that is, -^(hs) = 2 for any row 
h of H whose i-th component is 1. This implies that c satisfies all the parity-check 
equations corresponding to the rows h of H whose i-th component is 1. Thus, when 
% varies in S — supp(c), c must satisfy every parity-check equation in H, i.e., c is a 
codeword. 
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B. Proof of Theorem [3] 

Let y = x/ Y^ 1 j=i x j- Since x G K(H), then y G K(H). For fixed j, 1 < j < 
n, let h ?1 , h 92 , . . . , h ?7 be the rows of if whose j-th components are 1, i.e., h qi = 
{h gi ,i, h qi7n ) and h qid = 1 for 1 < % < 7. Since y 6 if (#), < 2/ify?i,/> « = 
1, . . . , 7. Hence, 

7 / 7 

7% < Y Y yih ^ 1 = Yl y i ( Y h *> 
%=\ 1^3 i^j \i=i 

For any I ^ j, since the Z-th column and j-th column of H have at most A common l's 
and h qi j = 1 for 1 < i < 7, we have that ^2] =1 h Qi j < A. Therefore, 

!Vj< Aj^ = A(l-yj), i.e., < — ^-y, 

which implies that y G ^"z+i- Hence, by Lemma [Q, wp(x) = wp(y) > ^ + 1, and 
equality holds if and only if y has exactly Z + 1 non-zero components with value 
A/ (7 + A). In that case, (j + l)y = + l)x/ X^=i x j mus t be a binary vector, say c, 
with weight ¥ + 1. 

Now, we show that c must be a codeword of C. Since c G K(H), supp(c) is a 
stopping set of if, i.e., the restriction of H to supp(c), say H(c), has no rows of weight 
one. Note that any two distinct columns of H(c) have at most A common l's. Suppose 
b is a non-zero row of H(c) and the j-th component of b is 1, where j G {1, 2, . . . , ^ + 1}- 
Since the j-th column of H(c) has at least 7 l's, there exists a 7 x + 1) matrix, 
say H(c,j), consisting of b and other 7 — 1 rows of H(c) such that the j-th column of 
H(c,j) is the all-1 column. Since any two distinct columns of H(c,j) have at most A 
common l's, each of the columns other than the j-th column has at most A l's. Now 
we count the number of l's in H(c,j), say A, in two ways. From the view of columns, 
A < 7 + A^ = 27. From the view of rows, since supp(c) is a stopping set, each row of 
H(c,j) has at least two l's, which implies A > 27. Thus, A = 27, and every row of 
H(c,j) has exactly two l's, which implies u>#(b) = 2. Hence, the weights of rows of 
H(c) are either or 2, which implies that c satisfies every parity-check equation in H 
and thus is a codeword. 
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V Conclusions 



In this correspondence, we study the minimum pseudo-weight and minimum pseudo- 
codewords of LDPC codes. We characterize the pseudo- codewords of an LDPC code 
which attain the lower bound 4 of Kelley, Sridhara, Xu, and Rosenthal on the min- 
imum pseudo- weight. That is, the pseudo-weight of a pseudo-codeword of an LDPC 
code is equal to d L if and only if this pseudo-codeword is a real multiple of a codeword 
with weight d^. Furthermore, it is shown that if the minimum distance of this LDPC 
code is equal to di, then the minimum codewords, the nonempty stopping sets of small- 
est size and the minimum edges are all equivalent, which implies that LP decoding is 
asymptotically optimal for this LDPC code. Then, we show that the lower bound of 
Kashyap and Vardy on the stopping distance of an LDPC code is also a lower bound 
on the pseudo-weight of a non-zero pseudo-codeword of an LDPC code with girth 4. 
The same characterization results mentioned above for the lower bound of Kelley, Srid- 
hara, Xu, and Rosenthal are also obtained for this new lower bound on the minimum 
pseudo- weight. Some LDPC codes are listed to illustrate these results. Finally, we 
pose a further research problem: For a binary LDPC code C, construct a parity-check 
matrix H with minimum number of rows such that the minimum pseudo-weight of C 
is equal to the minimum distance of C, and the number of minimum edges is equal to 
the number of minimum codewords of C, i.e., LP decoding is asymptotically optimal 
for this LDPC code. Until now, we do not even know whether such a parity-check 
matrix exists for every binary linear code. 
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